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We discuss superstatistics theory of labour productivity. Productivity distribution across 
workers, firms and industrial sectors are studied empirically and found to obey power- 
distributions, in sharp contrast to the equilibrium theories of mainstream economics. The 
Pareto index is found to decrease with the level of aggregation, i.e., from workers to firms 
and to industrial sectors. In order to explain these phenomenological laws, we propose a 
superstatistics framework, where the role of the fluctuating temperature is played by the 
fluctuating demand. 
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§1. Introduction 



In mainstream macroeconomics, the marginal productivity is believed to be equal 
across workers, firms, and business sectors, which is identified with nature of equi- 
librium: If the productivity is not equal, there will be profit-opportunity, which, 
according to the standard economic equilibrium theory, is against the notion of equi- 
librium. This, however, is far from physics understanding: When a system made of 
many constituents, such as gas made of molecules, is in an equilibrium state, phys- 
ical quantities, such as energy of the constituents is not unique, but is distributed 
obeying certain statistical laws, such as Boltzmann law. The exchange of energy 
between the constituents do not equalize the energy, but manifests itself in the re- 
alization of the statistical distribution. Keynes' economic is somewhat close to 
our physics view, allowing involuntary unemployment, which leads to distributions, 
although wider perspective may be desired. 

Thus, the study of the distribution of the productivity ***) is an important issue 
in establishing the notion of the economic equilibrium. 

We approach this problem just as in any physics, or rather, any branch of sci- 
ence: We fist study the phenomena itself to find phenomenological laws and then 
will search for theoretical understanding behind them, seeking further tests and re- 



*' Talked presented at the Yukawa Institute workshop "Econophystcs III -Physical approach to 
social and economic phenomena" (YITP-W-07-16) on Dec. 24, 2007. 
**' hideaki. aoyamaOscphys. kyoto-u.ac.jp 

***-* In stead of "distribution", the term "dispersion" is commonly used in Economics. Thus, 
"productivity dispersion" in stead of "productivity distribution". Since this conflicts with normal 
physics usage of "dispersion", we will use the latter word for accuracy. 
^' "Stylized facts" in Economics terminology. 
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finements. 

§2. Phenomenology 

The database we have used is the Nikkei-NEEDS (Nikkei Economic Electronic 
Databank System) databasejH which contains financial data of all the listed firms in 
Japan. As such, it is a well-established and representative database, widely used for 
various purposes from research to practical business applications. For our purpose, 
we used their 2007 CD-ROM version and extracted data for the period between 1980 
and 2006, which contained some 1,700 to 3,000 firms and 4 to 6 million workers, with 
the numbers increasing with the year. 

In the following, we study the labour productivity c; 

y , ^ 

c:=-^, (2.1) 

where Y is the production (in currency) and L is the labour (in number of workers). 
To be exact, this is the mean labour productivity, different from the margmanabour 
productivity dY/dL dealt in the equilibrium theory of the economics. This difference, 
however, does not affect the following discussion of the superstatistics theoryjH due to 
the fact that they obey Pareto distribution as we will see below. Also, we have found 
that the values of c calculated from this database are sometimes inappropriately 
large. Part of this may come from the fact that "the number of employees" reported 
in this database does not contain temporally workers. Also, firms that became stock- 
holding firms reported huge reduction of the number of employees (as it is defined to 
be the value at the end of that year), while maintaining same order of sales revenue 
in the year the conversion occurred. This results in absurd values of the productivity 
c for that year. Because of these abnormalities, we have excluded top-ten firms in 
terms of the productivity each year. This roughly corresponds to excluding firms 
with productivity c > 10^ yen/person. (We have carried out analysis with several 
different cuts, i.e., with cutting top-twenty firms, cutting on the value of c, etc.), 
but the result remained mostly stable.) 

The plots of the cumulative distribution function (cdf ) of the productivity, which 
is defined in terms of the probability density function (pdf ) p{c) as; 

/oo 
p{c) dc, (2-2) 

are given in Figs {l|]3] for three different levels of aggregation in 200511 

As is evident from these figures, the productivity c obeys the Pareto law (power- 
law) in the asymptotic region at all three levels: 

p>{c)occ~^ (c^oo), (2-3) 

The exponent /i is called Pareto index. This qualitative feature is true for all the 
years we have studied. 

Study of the Pareto index calls for a careful analysis: Since the power-law is a 
straight line in the log-log plot of the cdf, it is tempting, at least for a novice, to take 
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Fig. 1. Productivity Distribution across Firms (2005) Notes: The productivity c is in the 
unit of lO^yen/person. The best fits for the exponential law and the power law is obtained for 
fO < c < 3000. 
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Fig. 2. Productivity Distribution across Business Sectors (2005) 
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Fig. 3. Productivity Distribution across Workers (2005) 



the (almost-)straight section of the plot and fit is Unearly, obtaining the Pareto index 
as the gradient of the best-fit straight hne. This is rather dangerous, as its value 
often depends on the choice of the section, which is somewhat arbitrary. Even if a 
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definition of tlie section is good for a particular year, it may become inappropriate 
in later years. Changing the choice of the section from a year to another year will 
destroy objectivity of discussion of the evolution of the Pareto index over the years. 

For these reasons, it is desirable to use a distribution defined for the whole 
region of c E [0, oo) that behaves as a power-law in the asymptotic region. One 
such distribution that is suitable for numerical analysis and is general enough for 
our purpose is the "Generalized Beta Distribution of the Second Kind" (032)13' It 
is defined by the following cdf: 



B{z,^i/q,u/q) 
B{li/q,v/q) 



1 + 



(2-4) 



where /x,z/, g, ci > and B(z,s,t) is the incomplete Beta function with B{l,s,t) 
B{s,t). The corresponding pdf is the following: 



P 



(GB2), 



c) 



1 



For large c this behaves as follows 



B{^i/q,v/q) c \ci 



and its cdf as; 



p. 



(GB2) 



(2-5) 



(2-6) 



(2-7) 



B{fJ-/q,i^/q) Vci, 

Therefore, the parameter /i is the Pareto index and the parameter ci a scale for the 
power law. For c ^ 0, this distribution also reduces to a power law with the power 
exponent equal to u. 

In general, the parameter q determines how persistent two power-laws at both 
ends are; for small q the transition from the small-c power law to the large-c power 
law is smooth. In such a case, it can be approximated by the log-normal distribution 
around its peak at 



Cln 



1/5 



and 



a 



u + 11 



(2-9) 



As the log-normal distribution is widely observed for small-to-medium range in real 
economic distributions, this model-distribution has a good chance of being a valid 
approximation. 

Fitting the data with this OB2 distribution by the maximum likelihood method, 
we have obtained the values of Pareto index plotted in Fig|4] From these, we find 
the following two phenomenological laws. 
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1980 1985 1990 1995 2000 2005 

Fig. 4. Pareto Indices of the Productivity Distributions across Workers, Firms, and 
Industrial Sectors 

I. The distribution of productivity obeys the Pareto distribution {i.e. the power- 
law for the high productivity group) at every level of aggregation, that is, across 
workers, firms, and industrial sectors. 
II. The Pareto index, namely the power exponent decreases as the level of aggre- 
gation goes up: > /"F > At5- 

In the following, we present theoretical framework to explain these laws. 

§3. Yoshikawa-Aoki theory 

Yoshikawa and AokH© proposed an equilibrium theory of productivity distribu- 
tion several years ago. The key ideas are summarized by the following correspondence 
with the statistical physics: 

Economics 44> Physics 

Firms 4^ Energy Levels 
Workers <^ Molecules 
Worker's Productivity 4^ Molecule's Energy 
Aggregate Demand 4^ Total Energy 

The last correspondence may be explained as follows: Let us denote the total 
number of firms by K and each firm is labeled by an index k {k = 1,2,..., K). The 
number of workers nt at the firm k is constrained by the total number of workers A^; 

K 

J2nk = N. (3-1) 

k=l 

The productivity is constrained by the fact that the sum of firm's production is 



6 



Aoyania, Yoshikawa, lyetomi and Fujiwara 



the total production, which is equal to the aggregate demand D; 

K 

Y^nkCk = D. (3-2) 



fc=i 



This constraint (3-1) corresponds to that on the total number of molecules, while 
(3-2) to that on the total energy. 

Yoshikawa and Aoki postulated that the actual distribution is the one that max- 
imized the entropy, arriving at the Boltzmann law for the probability pk of the 
worker's productivity being equal to c^; 

where Z{(3) is the usual partition function: 

K 

Z{P):=Y,^-P''- (3-4) 

fc=i 

The inverse-temperature f3 is determined by the mean demand D as follows: 

°^=^ = -|'"^""^ P'^' 

The above result shows that the productivity distribution of workers is deter- 
mined by (1) the productivity distribution of firms and (2) the mean demand D. 
This is readily exposed in the continuous notation: Denoting the pdf of the firm's 
productivity by p^'^\c), the pdf of the worker's productivity p^^\c) is given as fol- 
lows; 

p(W)(c) = ^e-'^V^)(c), (3-6) 
where the partition function is 

Z{(3) ■= / e-'^'p^^\c)dc. (3-7) 



The firm's productivity distribution p^^\c) is analogous to the density of energy 
levels. 

Although this theory is rather attractive for its simplicity and elegance, it does 
not meet either of the phenomenological laws I and II we have established above: 
Given that p^^\c) obeys power- law, this theory predicts that p^'^\c) has additional 
exponential dumping, which is far from the reality. We thus proceed to superstatis- 
tics theory build on this platform. 

§4. Fluctuating Aggregate Demand and Superstatistics 

The implicit assumption in the Yoshikawa-Aoki theory that the aggregate de- 
mand D is constant is an oversimplification; the demand fluctuates. 
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Macro-system under fluctuations of external environment may be handled by 
the superstatistics^^ In this theory, the system goes through changing external 
influences, but is in equilibrium within certain limited scale in time and/or space, in 
which the temperature may be regarded as constant and the Boltzmann distribution 
is achieved. In other words, the system is only locally in equilibrium; globally seen, 
it is out of equilibrium. Thus the key concept in the superstatistics is to introduce 
averaging over the Boltzmann factors. 

There are several model-cases where superstatistics was applied successfully. 
Among them, the Brownian motion of a particle going thorough changing environ- 
ment pI'IM such as different temperature and different viscosity both in space and 
time, provides a good analogy to our case. Like a particle in Brownian motion, each 
flrm experiences ever-changing demand, the flrst of which is the aggregate demand 
D. Further, flrms in different business sector or with different product must meet 
their own and individual demand, which is analogous to the special fluctuation of 
environment. Average over these various possible fluctuations could be summarized 
in a form of averaging over the temperature, or in other words, D. Although it is 
dangerous to extend this interpretation too far with a lack of rigorous treatment of 
the associating stochastic process, we would always keep in mind that the following 
discussion of the fluctuation of the aggregate demand may be taken as a symbolic 
representation of many other kinds of fluctuation. 

In superstatistics, the familiar Boltzmann factor e~^'^ is replaced by a weighted 
average: 

/•oo 

B{c) = / e-^'^fp{f3)dP, (4-1) 
Jo 

Here, the weight factor fp^P) represents the changing environment. Note that be- 
cause /3 is a monotonically decreasing function of the mean demand D, the weight 
factor //3(/3) represents fluctuation of D. With this weight factor, the pdf of worker's 



productivity (3-6) is now replaced by the following: 



P^^\c) = ^p(-^\c)B{c). (4-2) 
Here, the partition function Zb is also redeflned as 

POO 

Zb= p^^\c)B{c)dc. (4-3) 
Jo 



Let us now examine whether p^^\c) in Eq.(4-2) obeys the Pareto law for high 



productivity c. As the integration in Eq.(4-1) is dominated by the small /3 (high 
demand) region for large c, the behaviour of the pdf ff^{P) for small /3 is critical. Let 
us assume the following in this range: 

f(3if3)^(3~\ (7<1), (4-4) 

where the constraint for the parameter 7 comes from the convergence of the in- 



tegration in Eq.(4-1). The proportional constant is irrelevant because p^^\c) is 
normalized by Zb- This leads to the following B(c) for large c: 

B{c) (X r{l--f)c^-\ (4-5) 
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Substituting this and the Pareto law for the firm's productivity, 



(4-6) 



into Eq.(4-2 ), we obtain the productivity distribution across workers obeys the Pareto 
law; 

p(W)(c) oc c-'^w-i^ (4.7) 



with 

= /^F - 7 + 1- 

Because of the constraint 7 < 1 , this leads to the inequality 



(4-8) 
(4-9) 



This agrees with our empirical observation II. 

The parameter 7 in the distribution of /? is related to a parameter in distribution 
of D, as P is related to D by Eq.(3-5 ). Therefore, let us now examine the consequence 
of the relation (3-5) for small p. 

§5. (3 and D 

We first note the following three basic properties (i)-(iii). 

(i) The temperature, T = is a monotonically increasing function of the aggre- 
gate demand, D. We can prove it using Eq.(3-5) as follows: 



dD 



1 dD 



d^ 



where (c")^ is the n-th moment of productivity defined as follows: 

1 



/3 



m 



cV^)(c) e-f^^dc. 



(5-1) 



(5-2) 



Note that (c)^ = D. This is a natural result. As the aggregate demand D rises, 
workers move to firms with higher productivity. It corresponds to the higher 
temperature due to the weight factor e~^'^. 
(ii) For T ^ (/3 ^ 00), 

D ^0. (5-3) 



This is evident from the fact that in the same limit the integration in Eq.( |3-7| 
is dominated by c ~ due to the factor e~^^. 
(iii) For T ^ 00 (/3 ^ 0), 



D 



cp 



(^)(c)dc (= (c)o). 



(5-4) 



This can be established based on the property (i) because D = {c)p —> (c)o as 
/3 ^ and Z(0) = 1. 
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Let us now study the small /3 (high temperature) properties. One possible 
approximation for Eq.(3-7) is obtained by expanding the factor e~^'^ = 1 — /3c + • • • 
and carrying out the c-integration in each term. This leads to the following: 



m 



p^^\c) ( l-/?c+^(/3c)" + ... ) dc 



1 



1 



l-(c>o/3+2(c')o/3' + ..., 



where we have used the normalization condition, 



P 



(^)(c) dc = l. 



(5-5) 



(5-6) 



The result (5-5) is, however, valid only for /xp > 2 since (c )o is infinite for fip < 2, 
which is true as we have seen. 

The correct expansion for 1 < < 2 is done in the following way. We first 
separate out the first two terms in the expansion of the factor e~^'^; 



m 



p(^)(c) (l - /3c + (e-/^^ - 1 + /?c)) dc 
l-{c)of3 + Z2{(3), 



dc 



P^^(c))g(c)dc 



'(F). .95(c) 
^> dc 



dc, 



(5-7) 



(5-8) 



where g{c) = e '^'^ — 1 + /3c is a monotonically increasing function of c with 

5(0) = <7'(0) = 0. (5-9) 



The c-integration in Eq.(5-8) is dominated by the asymptotic region of c for small 
(3. Therefore, the leading term in Z2{(3) is evaluated by substituting the asymptotic 
expression of p^^-'(c); 



(5-10) 



into Eq.(5-8). We thus arrive at the following: 

-'^^ dg[c) 



c 

Vco 



dc 



dc + 



flFr{-IJF){coPr^ + 



(5-11) 



The case /ip = 2 can be obtained by taking the limit /Up — > 2+ in the following 
expansion valid for 2 < //p < 3: 



1 



Z(/3) = 1 - (c)o/3 + -((c^)o - (c)^)/3^ + ;Upr(-^p)(co/3)'^^ + 



(5-12) 
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which can be obtained in the manner similar to the above. The third term is finite 
for jUp > 2, but diverges as /ip — > 2+ as 



2cl 



fJ-F 



(5-13) 



This cancels the divergence of the fom'th term in the same limit and the remaining 
leading term is as follows: 



Z{p) = 1 - {c)oP - (co/3)' log(co/3) + . . . . 
In summary, the partition function behaves as follows: 

'l_(c)o/3+i(c2)o/52 + ... 
Z{(3) = h- (c)o/3 - (co/3)2 log(co/3) + . . . 

_ 1 - (c)o/3 + /iFr(-^F)(co/3)^^ + . . 

Substituting the above in Eq.(|3-5|), we obtain the following 



for 2 < /ip; 
for fip = 2; 
for 1 < < 2. 



(5-14) 



(5-15) 



D 



'(c)o-((c2)o-(c)2) /?+.... 

(c)o + 2c2/31og(co/3) + ... 

^(c)o-/iF'r(-^F)C/3^^-^ + ... 



for 2 < //f; 
for ^F = 2; 
for 1 < /iF < 2. 



(5-16) 



§6. Pareto Indices and the Demand 

As the distributions of (3 and D are related by 

f^{P)dl3 = fD{D)dD, 



we find from Eqs.(4-4) and (5-16) that 



fD{D) cc ((c)o - D) 



-5 



with 



7-1 



6 — 1 for 2 < ;Uf; 

{HF-1){6-1) forl<^F<2. 



(6-1) 
(6-2) 
(6-3) 



At /iF = 2, we need additional logarithmic factors for f/s^P), but the power of f3 is 
essentially the boundary case between the above two, 7 = 5. Also, the parameter S 
is constrained by; 

S < 1 (6-4) 

from the normaliz ability of the distribution of foiD), which is consistent with the 
constraint 7 < 1 and Eq.(6-3). 

Combining Eqs.(4-8) and (6-3), we reach the following relation between the 
Pareto indices: 



fiF — 6 + 1 for 2 < ^F 

(/iF - l)!-*^ + 1) + /XF for 1 < /iF < 2. 



(6-5) 
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This relation between nw and /Ui? is illustrated in Figj5j As noted previously, because 
of the constraint 5 < 1, Eq.(6-5) necessarily makes /iw larger than /ip, in good 



agreement with our empirical finding. Incidentally, Eq.(6-5) has a fixed point at 
(/xWjA*f) = (1)1); the line defined by Eq.(6-5) always passes through this point 



irrespective of the value of 6. The Pareto index for firms is smaller than that for 
workers, but it cannot be less than one, because of the existence of this fixed point. 

The superstatistics framework presented above may apply for any adjoining 
levels of aggregation; Instead of applying it for workers and firms, we may apply it 
for firms and industrial sectors. Then, we can draw the conclusion that as we go 
up from firms to industrial sectors, the Pareto index again goes down, albeit for a 
different value of 5. This is illustrated in FigjHj Because of the existence of the point 
(1, 1), as the aggregation level goes up, the Pareto index is driven toward 1, but not 
beyond 1. At the highest aggregation level, it is expected to be close to one. This 
is again in good agreement with our empirical finding that the Pareto index of the 
industrial sector fis is close to one (see Figj4|. 

In summary, the superstatistics framework successfully explains two empirical 
findings we have made. Furthermore, given the measured values of fiwi and fip, the 



relation (6-5 ) can be used to determine the value of S: 



/XF — /Uw + 1 for 2 < fip; 



. - 1 



+ 1 for 1 < /iF < 2. 



(6-6) 



The result is shown in FigjT} Recall that 6 is the power exponent of the distribution 
of aggregate demand, D. Therefore, low S means the relatively low level of the 
aggregate demand. In FigjTjwe observe that the aggregate demand was high during 
the late 1980's, while beginning the early 90's, it declined to the bottom in 2000- 
2001, and then, afterward turned up. It is broadly consistent with changes in the 
growth rate during the period. 
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Aggregation Level 

Fig. 6. Illustration of changes of the Pareto index as the aggregation level changes in two steps, 
each with a different value of 5. 
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Fig. 7. The values of S calculated from Eq.(6-5l for the Japanese listed firms 
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